Picture for Callum McDougall

Callum McDougall

Scaling Monosemanticity: Extracting Interpretable Features from Claude 3 Sonnet

Add code
May 28, 2026
Viaarxiv icon

SAEBench: A Comprehensive Benchmark for Sparse Autoencoders in Language Model Interpretability

Add code
Mar 13, 2025
Viaarxiv icon

Copy Suppression: Comprehensively Understanding an Attention Head

Add code
Oct 06, 2023
Figure 1 for Copy Suppression: Comprehensively Understanding an Attention Head
Figure 2 for Copy Suppression: Comprehensively Understanding an Attention Head
Figure 3 for Copy Suppression: Comprehensively Understanding an Attention Head
Figure 4 for Copy Suppression: Comprehensively Understanding an Attention Head
Viaarxiv icon